Speeding-Up Action Learning in a Social Robot With Dyna-Q+: A Bioinspired Probabilistic Model Approach

نویسندگان

چکیده

Robotic systems that are developed for social and dynamic environments require adaptive mechanisms to successfully operate. Consequently, learning from rewards has provided meaningful results in applications involving human-robot interaction. In those cases where the robot's state space number of actions is extensive, dimensionality becomes intractable this drastically slows down process. This effect specially notorious one-step temporal difference methods because just one update performed per robot-environment paper, we prove how action-based a robot can be improved by combining classical reinforcement methods, such as Q-learning or Q( λ), with probabilistic model environment. architecture, which have called Dyna, allows simultaneously act plan using experience obtained during real interactions. Principally, Dyna improves algorithms terms convergence speed stability, strengthens Hence, work embedded architecture our robot, Mini, endow it ability autonomously maintain an optimal internal while living

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speeding Up the Learning in A Robot Simulator

Q-learning is a one of the well-known Reinforcement Learning algorithms that has been widely used in various problems. The main contribution of this work is how to speed up the learning in a single agent environment (e.g. the robot). In this work, an attempt to optimize the traditional Q-learning algorithm has been done via using the Repeated Update Q-learning (RUQL) algorithm (the recent state...

متن کامل

the relationship between language and social capital in ilami kurdish: a sociopragmatic approach

چکیده زبان به عنوان یک وسیله در ایجاد و بازسازی سرمایه اجتماعی در چند دهه گذشته مورد توجه بوده است. اگر چه درباره سرمایه اجتماعی و سازه های مربوط به آن زیاد نوشته شده است ولی خیلی کم بر روی اینکه چطور زبان می تواند باعث ایجاد اعتماد یا بی اعتمادی بشود مطالعه ای انجام شده است. این مطالعه به منظور تحقق دو هدف انجام گرفته است. اول تلاش خواهد شد تا یک گونه شناسی از واژگانی که مردم کرد زبان شهر ا...

15 صفحه اول

Designing a Social Banking Model with a Post-Corona Approach

The first part of the economic system that was affected by the outbreak of the Corona pandemic was the banking system of countries. Therefore, the aim of this study was to design a social banking model with a post-corona approach in the country's banking industry, which uses a combination of Delphi-fuzzy method and interpretive structural modeling. In this study, the opinions of university prof...

متن کامل

Locking in Returns: Speeding Up Q-Learning by Scaling

One problem common to many reinforcement learning algorithms is their need for large amounts of training, resulting in a variety of methods for speeding up these algorithms. We propose a novel method that is remarkable both for its simplicity and its utility in speeding up Q-learning. It operates by scaling the values in the Q-table after limited, typically small, amounts of learning. Empirical...

متن کامل

A Developmental Approach to Robot Action Learning

An open question in robot action learning is how robots can detect relevant features of demonstrated actions. Robots which have no knowledge about the action nor the environment encounter the problem of not knowing what to detect and where to attend. Inspired by human parent-infant interaction, we suggest that parental action to infants can assist robots as well as infants detecting important a...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3095392